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Cross-Reference to Related Applications 

The following patent applications, all assigned to the assignee of this application, 
describe related aspects of the arrangement and operation of multiprocessor computer systems 
according to this invention or its preferred embodiment 



□ 

Q C ^ L S p atent applicqtinn-g ^nl - nnrnVinr / , by T R ^^ r g gt fll 




I! (BEA919990003US1) entitled "Method And Apparatus For Increasing Requestor Throughput 

uh / 

hi 10 By Using Data Available Withholding" was ffled on Jariliaiy ^ 2002. 

! ; y U.S. patent application serial number _/__/_ by T. B. Berg et al. 

P (BEA92000001 7US1) entitled "Method And Apparatus For Using Global Snooping To 

Q Provide Cache Coherence To Distributed Computer Nodes In A Single Coherent System" was 

hD 

q filed on January , 2002. 

ill 

15 U.S. patent application serial number//—-, by T. B. Berg et al. 

(BEA920000018US1) entitled "Multilevel Oassification Method For Transaction Address 

Conflicts For Ensuring Efficient Ordering m A Two-level Snoopy Cache Architecture" was 

filed on January 2002. 

U.S. patent application serial number _/_, by S.G. Lloyd et al. 

20 (BEA920000019US1) entitled "Transaction Redirection Mechanism For Handling Late 

Specification Changes And Design Errors" was filed on January , 2002. 

U.S. patent application serial number / by T. B. Berg et al. 

(BEA920000020US1) entitled "Method And Apparatus For Multi-path Data Storage And 

Retrieval" was filed on January /_, 2002. 



Patent Application Specification 



BEA920000021US1 




!==- 

Q 



ru 



UrS: patent apphcanon serial number / . P ay T. B. Berg et al. 
(BEA920000022US1) entitled "Distributed Allocatior/Of System Hardware Resources For 

Multiprocessor Systems" was filed on January , 2002. 

U.S. patent application serial number 1/ . by W. A. Downer et al. 
5 (BEA920010030US1) entitled "Masterless Buipng Block Binding To Partitions" was filed on 

January , 2002. / 

U.S. patent application serial number / by W. A. Downer et al. 

(BEA92001003 1US1) entitled "Building Block Removal From Partitions" was filed on January 
t 2002. 

Q 1 0 U.S. patent application serial number / by W. A. Downer et al 

15? (BEA920010041US1) entitled "Masterless Building Block Binding To Partitions Using 

■j\ / 

i : y JdefttrftersrAnd Indicators" wa£ filed on January , 20027 

ui 



Background of the Invention 
Technical Field 

1 5 The disclosure relates to hardware support for partitioning a system into multiple 

subsystems so each may provide a distinct operating system environment or partition within the 
system. The disclosure particularly relates to systems which utilize directly addressable memory 
subsystems both locally by a processor or group of processors (node) and remotely by the 
other nodes or processors in the same system. 

20 

Description of the Related Art 

With the advent of multiprocessor computer systems which share memory across the 
system, many personal computers, work stations, and server computing machines using such 
multiple processor hardware architecture have become popular. Multiprocessor architecture 
25 has become an important feature because of the shared processing capabilities of coordinated 
multiprocessor systems which enhance performance by sharing memory across an entire 
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system, all of which may be accessible by multiple groups of multiprocessors (nodes) combined 
in a single system. Multiple processor computers can include large numbers of individual 
processors arranged in separate groups, each of which are associated with particular memory 
systems but all of which are coordinated with crossbar switch architecture systems allowing for 
5 data, data tagging and address control across the entire system. In the usual system 

architectural arrangement, processing boards, each of which has at least one processor and 
associated main memory, are connected by a high-speed crossbar switch communicating with 
each group of processors so as to maintain cache coherency among the various processors 

Li* 

across the system. 

4f 1 0 Users of a multiprocessor system have varying needs for the level of computing power 

ISS3 

ii! available from that system, or wish to run different operating systems simultaneously on the 

. ?% 

S.J. 8 

f || system. It is desirable to have the capability of dividing such a system running a single operating 

isi 

l"''"' system instance into multiple subsystems which each provides its own distinct operating system 

environment By partitioning a multiprocessor system in such a fashion, the system could be 

Q 1 5 reconfigured on demand to adjust the allocation of system resources across the operating 

□ system partitions to provide the appropriate sizing of resources to each partition. Various 

UK t 

patents issued relating to the field of the present invention teach multiple processor systems 
which have memory resource management techniques. The disclosures considered below 
illustrate the background art against which the advantages of the present invention will become 

20 apparent and be fully appreciated. 

Patent 5,905,998 issued to Ebrahim, et al., defines a system in which system memory 
and multiple processor units are each distinct rather than in a system where memory and 
processors are grouped together into nodes, each containing multiple processors and memory 
and where the mobile nodes exist in the operation of the system. This disclosure also requires 

25 the use and definition of master and duplicate cache tags. Patent 5,905,998 does not teach a 
method of partitioning of a processor system. 

Patent 6,088,770 issued to Tarui, et al., describes a multiprocessor system that is 
partitioned into various nodes which may allow different operating systems to function similar to 
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the present invention. This disclosure utilizes all the system memory shared as one, a distinct 
disadvantage. Further, 6,088,770 does not teach coherent sharing of memory between 
partitions. 

Patent 6,035,378 issued to James teaches memory page access monitoring logic that 
5 may provide information about memory access patterns, but does not teach the subject of 
partitioning memory systems. 

Patent 6,075,938 issued to Bugnion, et al., defines the use of layers of software that are 
inserted between the operating system and the application software such as to allow multiple 
|r operating systems to run concurrently on the system through the use of a virtual machine 

'"I 10 monitor. 

4* 

ijfi Patent 5,926,829 issued to Hagersten, et al., teaches a method for selecting between 

p |j two different caching modes between NUMA and COMA systems. The overall architecture 

Is! 

disclosed in Patent 5,926,829 employs distributed memory directories and interfaces and does 
! !5 - not to teach partitioning as presently disclosed. 

□ 1 5 Patent 5,893,144 issued to Wood, et al., is almost identical to the disclosure and 

□ teachings set forth in Hagersten, et al. above. 

1-5 S 

' u Patent 5,887,138 issued to Hagersten, et al., is similar to the '829 and '144 patent 

issued to Hagersten and Wood above. 

Patent 6,049,853 issued to Kingsbury, et al., defines a method in which portions of 
20 system memory used to store program code are copied into other locations in memory so that 
two users of the system may both have the code as local to increase system performance. This 
patent does not teach partitioning system resources. 

Patent 3,641,505 describes an older method of partitioning a system by using a less 
sophisticated method of providing large crossbar systems which interconnect processor 
25 modules of a particular type to other modules of types that require communication paths. 

Controls of the crossbar in this disclosure allows or disallows communications between them, 
thus effecting a crude partitioning method. 
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Patent 6,021,479 issued to Stevens defines a memory management system which 
effects memory management policies within the system. This disclosure provides for 
optimization of memory access latencies on a single system utilizing one operating system at a 
time. 

Summary Of The Invention 

A first aspect of the present invention is a multiprocessor computer system having at 
least two processing nodes, each node including at least one processor. Portions of a shared, 
distributed system memory are coupled to the processors. A communication pathway including 
□ a central hardware device connects a first requesting node which is requesting data, and a 

saps 

Ljl 10 second target node which is the destination of the requested data. The central hardware device 

Hp 

ry transmits requests from the said requesting node to the said target node but not to any other 

^ node. Preferably (but not necessarily), the nodes include local memory which is directly 

addressable both locally by the node associated with said memory and remotely by any other 
node. Also preferably, the central hardware device stores information for determining which 
q 1 5 nodes or processors are storing copies of one or more identified data elements in each said 
! ' 1 node's local memory. 

Another aspect of the invention is a method for partitioning into two or more partitions 
the resources of a multiprocessor computer system comprising a plurality of processing nodes; 
a shared, distributed system memory; and a communication pathway connecting the processing 
20 nodes. Each one of the processing nodes includes at least one processor; and a portion of said 
shared system memory coupled to the processor and the communication pathway. The 
communication pathway includes communications ports each dedicated to communicating with 
one of the processing nodes. The method assigns a physical address to at least two of the ports 
and a logical address to each physical address, and stores the logical address in a memory. 
25 Each of the processing nodes is defined to correspond to no more than one partition, and data 

is routed within the system to correspond with the addresses of the resources defined as a 
partition. 
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It is the object of the present invention to provide a means of hardware support for 
partitioning a multi-node processor system by providing data registers enabled or disabled in 
accordance with definitions of which nodes are defined within a given partitioa It is also the 
object of the present invention to provide a method of hardware support for partitioning a 
computer system utilizing multiple microprocessor groups or nodes, such that each partition 
may provide a distinct operating system environment while sharing the resources of other nodes 
in the entire system. It is further the object of the present invention to provide a method for 
coherent sharing of memory subsystems between nodes which are sharing a partition, where 
each partition has a portion of the total memory which is private to such partition and its range 
j 1 0 of addresses can be the same as another partitions similar memory addresses without allowing 
I II access between such separate memory systems. It is also an object of the present invention to 

i M 

provide a means to generate partition write protection register entries such as to enhance data 

- overwrite protection by disabling the system's ability to write or accept data to or from a 

□ partition for which the data is not intended. 

j 15 Other features and advantages of this invention will become apparent from the following 

ji j detailed description of the presently preferred embodiment of the invention, taken in 

' M conjunction with the accompanying drawings. 



ru 
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Brief Description Of The Drawings 
Fig. 1 is a block diagram of a typical multiprocessor system utilizing a data tag and 
20 address crossbar system in conjunction with a data crossbar system which the present invention 
may be employed to partition groups of processors to operate with independent operating 
systems. 

Fig. 2 is a diagram presenting an overview of the memory address map for the first 
processor group or quad in a given partition created by the invention. 
25 Fig. 3 is a logic diagram illustrating a physical port's hardware support for partitioning 

in accordance with the present invention. 
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Fig. 4 is a logic diagram illustrating the global register block which includes the partition 
error detection logic used in carrying out the present invention. 

Fig. 5 is a block diagram of the tag and address crossbar system connecting each 
quadrant or node in a multiprocessor system in which the invention is used. 
5 Fig. 6 is a block diagram of the mapping of the remote cache tags. 

Fig. 7 is a logic diagram illustrating the global configuration register logic used to 
generate the partition vector bits used in carrying out the present invention. 

Fig. 8 is a diagram illustrating the contents of a partition configuration register for 
j:« system port 0 used in accordance with the present invention. 

f\ 1 0 Fig. 9 is a diagram illustrating the partial contents of the global configuration register 



: =3 



<J3 



used in accordance with the present invention. 



I'll 

U J Detailed Description Of The Preferred Embodiment 



p Overview 

Ink 

j«* The preferred embodiment of this invention reconfigures or adjusts the allocation of 

~ j 1 5 system resources across operating system partitions within the same computer system to 

W provide the appropriate sizing of resources to each partition in a multiprocessor system 

containing multiple processors or multiple nodes. The preferred embodiment includes means to 
enforce and protect the isolation of each partition by creating write protection register entries. 
Within a multiprocessor system, at certain points where data is requested by a 
20 component or subsystem within the system, the identities of the requesters of the data are 
compared to a list of partitioned identifiers and partitioned membership lists, where the 
membership or the identities of the requesters are stored, depending on the configuration of the 
partitions. Requests are processed only in the case where the requestor of data accesses 
resources which belong to a given defined partition to which the requestor is stored on a 
25 membership list. Requests for data are denied to resources which are not in the membership list 
of the partition to which the requestor is a member. System resources in a multi-node system 
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are allocated dynamically between the partitions created and provides the means for protecting 
or isolating the partitions for accessing resources which are not within their partition. 

Partition configuration register means are used for each defined port providing for 
defined bit entries exclusively reserved for defining a given partition's enablement and a given 
5 partition's write protection as well as a port write protection. The preferred embodiment 

includes partition protection means comprised of register entries (bits) which enhance isolation 
of each partition by defining allowable access or writes from only those defined quads or nodes 
assigned to a given partition. A global configuration register means is used to configure system- 
wide parameters which include defined bit entries exclusively for use in defining write protection 

1 0 for each defined partition. 

Effective isolation of defined partitions within a multi-node system is provided 
preventing the possibility of overwriting data generated from an operation in a different partition 
from overriding data in a given partition to which said data was not meant to be written. 
Apparatus in the tag and address crossbar system of the multiprocessor system disclosed 

1 5 includes a tag comparator module and a dispatch module each separately capable of comparing 
a given tag's partition vector or requesting port to the required partition vector of an intended 
output port to prevent output of the tag to a port not defined for the given partition. Entries in a 
system global configuration register as well as registers which store the partition configuration of 
the various nodes are also disclosed. 

20 Technical Background 

The preferred embodiment of this invention is applicable to multiprocessor computer 
systems which utilize multiple microprocessor groups or nodes, each of which is provided with 
its own input and output interfacing and memory systems, including memory control. Such 
multiprocessor systems typically partition physical memory associated with one local group of 
25 microprocessors into locally available memory and remote memory or remote cache for use by 
processors in other processor groups within a system. Multiple node microprocessor systems 
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utilize tag and address crossbars as well as data crossbars to interconnect the various nodes or 
groups of microprocessors, thereby providing communications between such nodes. 

In the preferred embodiment, a total of sixteen microprocessors are configured into four 
separate quads, each quad also being referred to as a node of the system. The nodes in the 
5 preferred embodiment are therefore a total of four, referred to in the disclosure as nodes 0, 1 , 2 
and 3. The method of the preferred embodiment defines two translations which are used to 
resolve the target physical node identification (ID) for a given request for data in relation to 
partitioned membership and logical node ID. The first translation defines all the physical nodes 
within a system that are members of a partition, which has a unique partition ID (PaiUD). The 

1 0 second translation steers requests from a given partition to the physical target node to which the 
requested address refers. In addition to the target physical node, the physical node IDs of 
other interested nodes may need to be calculated. 

As used in the following description, ID refers to a value or number that identifies a 
system node or resource owned by or located within the same node. In the preferred 

1 5 embodiment, a node (or quad) includes four processors, associated memory and an 

input/output subsystem all interconnected through a system control agent, or controller all of 
which operate as a modular unit within the entire computer system. The memory control which 
interconnects the components within one group of processors, being one node, is 
interconnected to all the other node control agents through a tag and address crossbar system, 

20 as well as a data crossbar system. 

A physical node ID is fixed based on the physical placement of the node in the 
interconnect crossbar systems, and every node has a unique physical node ID value that is 
consistent with its physical interconnection to the overall system. The physical node ID does 
not change if the node changes its partition ID or logical ID. In the invention, a partition ID is a 

25 virtual identifier that all physical nodes in a partition will share. Nodes with any other value for 
their partition ID are by definition not a member of the given partition. If any two physical 
nodes share the same partition ID, they are both members of the same partition within the 
system. A logical ID is the node offset for a given address relative to the partition where the 
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address was requested The logical ID is unique to nodes within its own partition, but not 
necessarily unique relative to nodes in other partitions. In the implementation of the method 
disclosed herein, an address of a data request defines the logical ED of the node to which that 
address refers. 

5 The memory control always issues requests using physical addresses relative to its 

partitioa The physical address is itself relative to its partition number. Since more than one 
partition may be using the tag and address crossbar at a time, the partition number must be used 
to distinguish one partition's physical address from another's. Therefore, each tag and address 
crossbar physical port will have a two-bit partition number programmed to allow incoming 

1 0 requests to label themselves with their partition number. The said partition number is 

concatenated with address bits 36:35 (the logical node ID from the systems memory map) to 
form a lookup index that is used to look up the physical ID number that is used to identify the 
node where the address is defined as Home. The tag and address crossbar uses the physical 
node ED to route requests, replies, responses, and data associated with the request. The tag 

1 5 and address crossbar uses these physical IDs to identify source and target nodes (called snID 

and tnID) and to select tag quadrants for tag accesses. 

Tag accesses must be aware of partitions when they look for matches, perform 
updates, and check for conflicts. The tag and address crossbar has a hardwired assignment of 
port numbers to physical pins on the typical application specific integrated circuit device 

20 (ASIC) (used as a memory control agent interconnecting the resources of one node to the tag 
and address crossbar), therefore port number and physical node ID are equal. Each port of the 
tag and address crossbar has two configuration variables: Assigned Partition and Logical Node 
ID. All ports in the same partition have the same value for Assigned Partition. Within a 
partition, the Memory control on a port is home to a portion of the memory space; the unique 

25 space assigned to that Memory control is programmed into the Logical Node ID. Finally, the 
Partition Vector is a four-bit vector that identifies all the quads present in the partition to which 
that port belongs. The Partition Vector is actually calculated from the first two configuration 
variables of the four ports. The example below in Table 1 shows that Partition 0 consists of 
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port 2 only, and that Partition 1 consists of ports 1, 3, and 0 as logical nodes 0, 1, and 2. 
Therefore, port 1 is the logical node 0 in partition 1 (is home to the address range of 0 to 32 
GByte), and port 3 is logical node 1 in partition 1 (is home to the address range 32-64 Gbyte). 

Table 1 

5 Example of Partition Vectors 



I 

\ 

I 10 

s 



Port 


AssignedP 


Logical 


Partition 


Comment 


ID 


artition 


Node ID 


Vector 




0 


0 1 


1 0 


1011 


Partition 1, logical id 2, physical id 0 


S 1 


0 1 


00 


10 11 


Partition 1, logical id 0, physical id 1 


2 


00 


00 


0 100 


Partition 0, logical id 0, physical id 2 


3 


0 1 


0 1 


10 11 


Partition 1, logical id 1, physical id 3 



The tag and address crossbar will not propagate requests to ports that are not included in the 
Partition Vector of the port where the request was made. It may propagate requests and make 
replies to any or all of the ports that are included in the Partition Vector, as circumstances 

!) 1 5 require. In the above example, a request from port 1 can be propagated to ports 0 and 3 

1 

J because they are in port 1 's Assigned Partition. The request cannot be propagated to port 2, 

nor can the tag for port 2's Remote Cache be modified in any way due to port 1 's request. 

Technical Details 

The preferred embodiment relates specifically to a system and a method for use in a 
20 multiple processor system which utilizes a tag and address crossbar system in combination with 
a data crossbar system, together with associated memory and control means comprising a data 
processing system. The disclosed embodiment is a means and a method which allows 
partitioning of the entire system such that distinct operating systems may run simultaneously 
across the system. The method and means demonstrated below permits a system using multiple 
25 processors with a processor group interface control system, and an address tag and crossbar 

system, to partition one or more processor groups into partitions by dividing resources such as 
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system memory across the entire system but function independently for purposes of running 
separate operating system software on each separately partitioned group of one or more 
processors. System memory is divided among partitions but shared by the partition to which it 
is allocated. 

5 Fig. 1 presents an example of a multiprocessor system according to the preferred 

embodiment, showing a multi-processor system which utilizes four separate central control 
systems (control agents) 66, each of which provides input/output interfacing and memory 
control for an array 64 of four Intel brand Itanium class microprocessors 62 per control agent 
!«J 66. In many applications, control agent 66 is an application specific integrated circuit (ASIC) 

■% 1 0 which is developed for a particular system application to provide the interfacing for each 

isprs 

IJl microprocessor bus 76, each memory 68 associated with a given control agent 66, PCI 

'£ 

111 interface bus 21, and PCI input/output interface 80, along with the associated PCI bus 74 

UJ 

I " which connects to various PCI devices. Bus 76 for each microprocessor is connected to 

control agent 66 through bus 61 . Each PCI interface bus 2 1 is connected to each control agent 

O 15 66 through PCI interface block bus 20. 

Q Fig. 1 also illustrates the port connection between each tag and address crossbar 70 as 

ru 

well as data crossbar 72. As can be appreciated from the block diagram shown in Fig. 1, 
crossbar 70 and crossbar 72 allow communication between each control agent 66, such that 
addressing information and memory line and write information can be communicated across the 

20 entire multiprocessor system 60. Such memory addressing system is necessary to communicate 
data locations across the system and facilitate update of control agent 66 cache information 
regarding data validity and required data location. 

A single quad processor group 58, also referred to as a quad 58, is comprised of 
microprocessors 62, memory 68, and control agent 66. In multiprocessor systems to which the 

25 present invention relates, quad memory 68 is usually Random Access Memory (RAM) 

available to the local control agent 66 as local or home memory. A particular memory 68 is 
attached to a particular controller agent 66 in the entire system 60, but is considered remote 
memory when accessed by another quadrant or control agent 66 not directly connected to a 
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particular memory 68 associated with a particular control agent 66. A microprocessor 62 
existing in any one quad processor group 58 may access memory 68 on any other quad 
processor group 58. NUMA (Non-Uniform Memory Access) systems typically partition 
memory 68 into local memory and remote memory for access by other quads. 
5 In a typical multiprocessor system using distributed memory as shown in Fig. 1, the 

data tag and address crossbar 70 allows partitioning a total of sixteen individual processors 62 
shown in groups of four, into between one and four separate partitions. In describing the 
invention, reference will be made to a quad, also illustrated in the system diagram of Fig. 1. A 
quad, also a node in the present invention, operates as a separate node, each quad or node 

1 0 being interconnected to the rest of the system through ports that are associated with each quad. 
The memory map of a single quad of a multiprocessor system which uses the partitioning 
method is shown in Fig. 2. Each of the four quads 58 has one 32 GByte memory space 
allocated to it, of which 2 GByte is memory-mapped input/output (MMIO), CSRs (control and 
status data registers), and other non-memory addresses. Each quad 58 is hardwired to a port 

15 of crossbar 70. Each quad 58 is configured with a partition ID and a logical ED. The partition 
ED (0, 1, 2, or 3) designates the quad's membership in an operating system and shared memory 
space. The logical ID (0, 1 , 2, or 3) designates the quad's placement in the memory space of 
its partition. The logical ID specifies the physical address range that quad 58 treats as its local 
memory within that partition, i.e., the memory stored in its own memory 68. When a processor 

20 62 requests an address that has the same logical region as its quad's logical ID, then the 
address is local to the processor or "home", otherwise it is "remote" to that processor. 

The memory map of all quads in a partition is orthogonal to all addresses in another 
partition. System 60 can therefore be configured in any possible arrangement of partitions 
provided that each partition has a unique partition ID and each quad 58 has a unique logical ID 

25 within that partition. It should be appreciated that quad 58 is the smallest configurable unit in 
terms of partitioning a system using the method of the invention, as individual processors, IO 
(input/output) devices, or portions of memory 68 in quad 58 cannot be configured to different 
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partitions. Each quad is allocated a 128 MByte Remote Cache memory, which resides in the 
main memory 68 of each quad 58. Crossbar 70 maintains tags for each quad's Remote Cache. 

As a system is configured with virtually identical processor groups (nodes) or quads 58, 
the entire system may be partitioned as a single system or up to four separate partitioned 
5 systems using the method disclosed. In the preferred embodiment, the maximum total number 
of quads 58 is four, as configured in Fig, 1. Every port of data tag and address crossbar 70 is 
assigned to one of four control agents 66 by virtue of its physical connection between agent 66 
and crossbar 70. Interconnections between tag and address crossbar 70 and data crossbar 72 
to each of control agents 66 are accomplished through bus 71. Shown in Fig. 1 as a 

1 0 connection from tag and address crossbar 70 and data crossbar 72 to the control agent 66 in 
quad one, the bus is also referred to as a port. Though shown only at quad one, the 
configuration of bus 71 is duplicated for each quads 58 as can be appreciated by the 
connections for ports 0, 1 , 2 and 3 shown in Fig. 1. Bus 73 is the portion of bus 71 that 
connects control agent 66 to tag and address crossbar 70. Bus 75 is the portion of bus 71 

1 5 which connects the data crossbar 72 to each control agent 66. Each of the quads of the system 

demonstrated in Fig. 1, communicate to the remaining portions of the system through tag and 
address crossbar 70 as well as data crossbar 72 through channels defined as ports. Ports 0, 1, 
2 and 3 are all shown on Fig. 1 interconnecting the crossbar systems with the quadrant or node 
control agent 66 through input and output portions of each port, interconnecting each crossbar 

20 to each given quad. All of the processor groups or quads 58 in Fig. 1 are connected in a 

similar fashion, as can be appreciated from the figure, utilizing interconnect bus 71 as shown in 
port 1 of Fig. 1. The crossbar systems, including the ports interconnecting the crossbars with 
each of the quads 58 are essentially a communication pathway connecting the processing nodes 
so that they may all share access to data stored throughout the system. 

25 Fig. 5 illustrates internal logic of tag and address crossbar 70 shown in Fig. 1. Input 

40 for port 0, input 41 for port 1 , input 42 for port 2, and input 43 for port 4 illustrate part of 
the communications pathway connections each control agent 66 in each quad or node into tag 
and address crossbar 70. Likewise, Fig. 5 illustrates port 0 output 45, port 1 output 46, port 2 
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output 47 and port 3 output 48, each said output also illustrated on the entire system block 
diagram shown in Fig. 1. Tag look-up registers which function with tag and address crossbar 
70 are shown at 81a and 81b. Registers 81a and 81b are identical except that they are 
associated with an even pipeline and odd pipeline for tag processing as illustrated in Fig. 5. 
5 The dual pipeline design is provided to reduce latency in the system by assigning processing to 
even numbered tags to the even pipeline and odd numbered tags to the odd pipeline so that 
simultaneous processing may occur. 

Continuing with Fig. 5, input 40, 41, 42 and 43 are each introduced through a buffer, 
are operatively connected to even input multiplexor 50, and odd input multiplexor 51, the 

1 0 appropriate multiplexor (mux) being selected in accordance with the even or odd relationship 
with the input tag. Each multiplexor 50 and 51 serves to serialize the flow of tags from the four 
inputs. The outputs of muliplexor 50 and 51 are sent to another multiplexor to be sent 
ultimately to tag look-up registers 81a and 81b. Even pipeline logic 52 and odd pipeline logic 
53 evaluates the tags being presented and the request type to generate an output response and 

1 5 requests for ports that are connected to a defined quad within its partition. The resulting output 

entries are buffered in the dispatch buffer 54 and 55, each of which is a first in, first out (FIFO) 
type buffer. Dispatch buffers 54 and 55 decouples timing variances between the tag logic 
shown and the output selection logic. Entries are stored in dispatch buffers 54 and 55 in first in, 
first out orderuntil they can sent to the destination ports, being output 45, 46, 47 or 48, 

20 representing one output to each port or quad. 

Tag look-up register 81a and 81b, identical in configuration, are made up of four 
SRAMS chips, each being 512 kbits by 16 bits in size. Tag look-up register 81a is connected 
through line 82a to even tag comparator and dispatcher 84. Though shown as one connection 
in Fig. 5, connection 82a is actually four paths, each corresponding to inputs 0, 1, 2 and 3 from 

25 each port as described. Register 81b, connected to the odd tag comparator and dispatcher 85 
through connection 82b is essentially identical in function. Path 82b is likewise comprised of 
four paths, each corresponding to a port. Tag look-up registers 81a and 81b is comprised of 
external memory chips which interfaces with crossbar 70, and are used to store the tag and 
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state information for all of the remote cache tags in the entire system. Such information is not 
directly accessable by memory control agent 66, so all cacheable transactions generated in 
control agent 66 must access crossbar 70 to access or "snoop" crossbar 70 f s remote cache 
tags (RCTs). The physical configuration of register 81a and 81b is illustrated in the block 
5 diagram shown in Fig. 6. As shown in Fig. 6, register 81a and 81b is implemented with 

syncronous static random access memory chips (SSRAM) which operate at the internal clock 
frequency of crossbar 70, being 133 MHz in the present invention. As can be seen also in Fig. 
6, there are two groups of external SSRAMs, the groups being divided to odd and even 
pipelines as shown on Fig. 5. Each group of RCTs is split into four separate chips, with each 
1 0 separate chip representing a physical port of crossbar 70. As there are a total of four ports in 
the preferred embodiment as shown in the system diagram of Fig. 1, it can be appreciated that 
each port corresponds to a potential logical quad in the present invention, as earlier described. 
Therefore, each port of the RCT interface represents the RCTs for a physical quad's remote 
cache. 

1 5 One means of partition data overwrite protection is handled by tag comparator and 

dispatcher 84 and 85 in Fig. 5. Transactions being tracked by tag look-up register 81a and 
81b can never be output to path 82a or 82b if such transactions are not defined to be in the 
partition to which such output would be writing. For example, in the four node system 
illustrated in Fig. 1, if two partitions were defined, it can be seen that input 40 and 41 may 

20 belong to one partition while 42 and 43 are defined in a second partition. Tags being input into 
40 and 41, through use of the present invention, are processed and eventually compared to 
determine whether they may be outputted from register 81a and 81b through connection 82a 
and 82b. In a two-partition system, the invention prevents possible outputting through those 
connections between registers 81a and b and comparators 84 and 85 on lines which would be 

25 defined strictly for input 42 and 43. Thus a level of protection for existing partitions is 

presented. Tag look-up register 81a and 81b keeps track, on a global basis, of a particular 
transaction's membership in a partition. 
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Further, considering Fig. 5, data destined for a particular partition outputted from 
comparator and dispatcher 84 or 85 enters dispatch buffer 54 or 55, depending on whether the 
data is identified as even or odd as described above. Dispatch buffers 54 and 55 further 
provide partition overwrite protection by preventing a tag which is not defined as being within a 
5 particular partition from being dispatched to an output not defined as being within the partition 
in which the tag belongs. Accordingly, a second opportunity to protect separate partitions from 
errors occurring because of some failure or breakdown of the hardware or logic is provided. 

Each port has configuration information that assigns its partition ID and logical ID 
(address range). The partition membership of each quad 58 is defined by the value of the 
10 partition ID field, logical address range is defined by the setting of the logical ID field Each 
port has a Commit and a Write Protect bit. The Commit bit verifies that the partition ID is 
valid; if Commit is not set, the partition ID is invalid and the effected quad 58 is thereby not a 
member of any defined partition, and thereby shares nothing with any other quad 58 in system 
60. 

1 5 The Write Protect bit allows a quad 58 to lock itself into a partition so that no other 

quad 58 can modify its configuration registers; a quad sets its Write Protect bit when it 
completes its boot sequence and partition integration tasks. When the Write Protect bit is set, 
any individual quad 58 always controls its own configuration and other quad 58 requesting a 
configuration change must do so by requesting the controlling quad 58 to make a configuration 

20 change. 

When booting the system with quad 58, crossbar 70 provides a CSR (a hardware 
storage register accessible to software to and from which software can read and write) 
Mapping Flag bits that allow communications between quads 58 even when such quads are in 
different partitions. Eight flag bits for each quad 58 may be written only for that effected quad 
25 58 and all thirty-two flags can be read by any quad 58. The address spaces for different 

partitions cannot be violated by simple address pointer errors because crossbar 70 prevents 
requests from being sent to any quad 58 in another partition based on the partition ED of the 
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inbound data request Therefore, in accordance with the invention, inter-partition accesses 
could only occur because of possible hardware faults or scan operations. 

Fig. 3 illustrates the logic utilized in supporting the partitioning of a physical port Fig. 3 
discloses the Home Port ID block 90 (Hop ID) which takes an incoming request of a target 
5 address and uses the logical target ID to determine the physical port that determines the 

physical port of which that request is home. Address bits [36:35] give the logical target ID of a 
request. They are compared to the programmed logical home ID of each of the potential 
output ports. PlogID is a vector coming from the CSR block. It gives the 2-bit logical home 
ID of each physical port that is programmed into a Partition Configuration Register. The 

1 0 comparison should result in at least one match. However, the same logical home ID may also 
exist in another partition, in which case there will be two matches. Therefore, it must be 
assured that the incoming and outgoing ports are in the same partition. A 4-bit partition vector 
is received by each instance of Hop ID 90. It should be understood that the asteric symbol is 
used in the present disclosure to represent a multi-instant reference, sometimes also refered to 

15 in the art as a wild card character. 

Fig. 7 illustrates the logic diagram of the partition vector generated by the global 
configuration register in the invention. Partition vector 94, consisting of 16-bits, is comprised of 
four separate vectors, each made up of 4-bits. P*pvec is a separate 4-bit vector defining each 
port in the system as it relates to a particular partition. POpvec 95 defines port membership in 

20 partition 0, Plpvec 96 defines membership in partition 1 , P2pvec 97 defines membership in 
partition 2, and P3pvec 98 defines membership in partition 3 - all of which is illustrated in the 
logic diagram shown in Fig. 7. 

Each P*pvec shown in Fig. 7 is connected to P*pvec input 99 shown on Fig. 3. 
Homeport ID block 90 shown in Fig. 3 is reproduced for each port, being a total of four for 

25 the entire system disclosed. It will be appreciated that the 4-bit output of P*pvec shown in Fig. 

7 will be connected in each instance to input 99 on Fig. 3. Each SNID 83 in Fig. 3, comprised 
of 2-bits, defines to which physical node block 90 is attached, being node 0, 1 , 2, or 3 in the 
system. Address 86 in Fig. 3 is comprised of the logical ID of the target address, comprised of 
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2-bits. P*pvec has each bit set or cleared to indicate which physical ports are in the partition to 
which this inbound port belongs. For example, if partition 0 includes physical ports 0 and 2 
then P*pvec would be 0101b. The partition vector is used as a mask to qualify the 
comparisons of the address bits and PloglD. There should be one and only one remaining 
5 match, which is fed into a 4:2 encoder to produce the home node ID (HnID). The 4:2 encoder 

values are shown in Table 2. 



Table 2 
4:2 encoder values 



one-hot 


HnID 


input vector 




0001 


00 


0010 


01 


0100 


10 


1000 


11 


others 


error 



The HnID is the physical port ID that corresponds to the logical target ID. P*IDerr is 
asserted if there is not exactly one match and P*Commit is asserted. P*Commit comes from 
the CSR block and is set when this inbound port is committed to a partition. The HnID 
indicates which physical port has been configured to be home for that address in this partition. 
20 It is used to route requests, replies, responses, and data associated with the request to the 
appropriate tag and address crossbar 70 output port. HnID is delivered to the Input mux 
during the AD phase. 

Continuing with the example above, assume the logical home IDs are configured so that 
Port 0 has logical ID 00b, Port 1 logical ID 10b, Port 2 logical ID 10b and Port 3 logical ID 
25 01b. PloglD will be 68h. A transaction coming into Port 0 with addr[36:35] of 10b is 

compared with PloglD. There will be a match of the logical ID of both Port 1 and Port 2 and 
the vector out of the comparators will be 01 10b. After it is summed with the partition vector of 
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0101b, the result is the one-hot vector 0100b, from which the encoder provides a HnID of 
10b. The match with Port 1 that is not in the same partition has been masked out. The 
transaction will be processed knowing that the home port is physical Port 2 based on the HnID 
of 10b. If the HnID is equal to the snID 83, (physical node ID), then the request is local, and 
5 LocalRemote is asserted to the TypeCk block. Otherwise it is remote and LocalRemote is de- 
asserted. It is delayed by a clock to arrive during the ID phase of the transaction. Table 3 
illustrates the content of the partition configuration register, a graphical representation of which 
is shown in Fig. 8 as an example of the register for port 0. 



Table 3 

1 0 Partition Configuration Register 



Name 


Bits 


Read/Write 


Definition 


ron u r aruuon 
Write Protect 


i 
i 


Rd/Wr 


wncn bcij aiiu run. v v^uiimuiicu io oci, pivvdiio aity 
Port Committed bit from transitioning from 0 to 1 if the 
associated Partition # bits match the Port 0 Partition bits 
unless the CSR write is from a port already committed 
to the partition. 


Port 0 Write 
Protect 


1 


Rd/Wr 


Prevents changes to the partition information associated 
with this port. If this bit is set CSR writes only from this 
port may change the contents of the bits associated with 
this port, including this Write Protect bit. 


PortO 
Committed 


1 


Rd/Wr 


Indicates that the Quad is committed to the partition 
(below). 


Port 0 Partition 

# 


2 


Rd/Wr 


Partition number to which the Quad attached to this 
port belongs. 


Port 0 Logical 
Node# 


2 


Rd/Wr 


Logical node number within the partition that this port 
represents. This field determines the address range 
owned by the Quad attached to this port 


PortO Enable 


1 


Rd/Wr 


When set this bit enables port 0 subject to certain 
conditions. If clear, port 0 is disabled and will not 
respond to port 0 inbound transactions. 
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Port 1 Partition 
Write Protect 


1 


Rd/Wr 


When set, and Port 1 Committed is set, prevents any 
Port Committed bit from transitioning from 0 to 1 if the 
associated Partition # bits match the Port 1 Partition bits 
unless the CSR write is from a port already committed 
to the partition. 


Port 1 Write 
Protect 


1 


Rd/Wr 


Prevents changes to the partition information associated 

vintVi th-ic r\r\rt Tf tViic Kit io CAt /^QT? \x/n"tAC rvnK/ "firviYI tVlIC 

wim mis pun. 11 ixuo on lb sci wiiico uiuy num uiio 
port may change the contents of the bits associated with 
this port, including this Write Protect bit. 


Port 1 
Committed 


1 


Rd/Wr 


Indicates that the Quad is committed to the partition 
(below). 


Port 1 Partition 
# 


2 


RH/Wr 

JMJ/ VV i 


Partition number to which the Quad attached to this 
port belongs. 


Port 1 Logical 
Node# 


2 


Rd/Wr 


Logical node number within the partition that this port 
represents. This field determines the address range 
owned by the Quad attached to this port 


Port 1 Enable 


1 


Rd/Wr 


When set this bit enables port 1 subject to certain 
conditions. If clear, port 1 is disabled and will not 
respond to port 1 inbound transactions. This bit is 
cleared at power-on reset 


Port 2 Partition j 
Write Protect 


1 


Rd/Wr 


When set, and Port 2 Committed is set, prevents any 
Port Committed bit from transitioning from 0 to 1 if the 
associated Partition # bits match the Port 2 Partition bits 
unless the CSR write is from a port already committed 
to the partition. 


Port 2 Write 
Protect 


1 


Rd/Wr 


Prevents changes to the partition information associated 

with tViiQ rv\rt Tf thiQ Hit i<5 <spt f^SR writf*^ nnlv from this 

WILLI ULLo LJvIl. 11 UllO Ult ID Ovl WlllvO VJ111Y JJ.VJ11I U.LLO 

port may change the contents of the bits associated with 
this port, including this Write Protect bit. 


Port 2 
Committed 


1 


Rd/Wr 


Indicates that the Quad is committed to the partition 
(below). 


Port 2 Partition 
# 


2 


Rd/Wr 


Partition number to which the Quad attached to this 
port belongs. 


Port 2 Logical 
Node# 


2 


Rd/Wr 


Logical node number within the partition that this port 
represents. This field determines the address range 
owned by the Quad attached to this port 


Port 2 Enable 


1 


Rd/Wr 


When set this bit enables port 2 subject to certain 
conditions. If clear, port 2 is disabled and will not 
respond to port 2 inbound transactions. 
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Port 3 Partition 
Write Protect 


1 


Rd/Wr 


When set, and Port 3 Committed is set, prevents any 
Port Committed bit from transitioning from 0 to 1 if the 
associated Partition # bits match the Port 3 Partition bits 
unless the CSR write is from a port already committed 
to the partition. 


Port 3 Write 
Protect 


1 


RdAVr 


Prevents changes to the partition information associated 
witn mis port n inis oil is sex u^jy wnies omy irom inis 
port may change the contents of the bits associated with 
this port, including this Write Protect bit. 


Port 3 
Committed 


1 


RdAVr 


Indicates that the Quad is committed to the partition 
(below). 


Fort i rartraon 

# 


2 


Kd/Wr 


r aration number to wnicn tne v^uaa attacnea to inis 
port belongs. 


Port 3 Logical 
Node# 


2 


RdAVr 


Logical node number within the partition that this port 
represents. This field determines the address range 
owned by the Quad attached to this port 


Port 3 Enable 


1 


RdAVr 


When set this bit enables port 3 subject to certain 
conditions. If clear, port 3 is disabled and will not 
respond to port 3 inbound transactions. 



Table 4 is a table of possible partition member register entries. The partition member register 
is read only. Its contents are calculated from the partition configuration register shown above in 
Table 3. It will be appreciated by review of Table 4 that the entries define the partition 
1 5 membership of each port, and therefore each quad. 



Table 4 
Partition Member Register 



Name 


Bits 


Read/Write 


Definition 


Partition 0 - 
PortO 


1 


RdOnly 


Set if Port 0 is a member of partition 0 


Partition 0 - 
Port 1 


1 


RdOnly 


Set if Port 1 is a member of partition 0 


Partition 0 - 
Port 2 


1 


RdOnly 


Set if Port 2 is a member of partition 0 
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Partition 0 - 


1 


RdOnly 


Set if Port 3 is a member of partition 0 




Port 3 










Partition 1 - 


I 


RdOnly 


Set if Port 0 is a member of partition 1 




PortO 








5 


Partition 1 - 
Port 1 


j 


RdOnly 


Set if Port 1 is a member of partition 1 




Partition 1 - 


! 


RdOnly 


Set if Port 2 is a member of partition 1 




Port 2 










Partition 1 - 


! 


RdOnly 


Set if Port 3 is a member of partition 1 


i 10 


Port 3 








0 


Partition 2 - 


! 


RdOnly 


Set if Port 0 is a member of partition 2 ! 


◦ 


PortO 








■■-■r- 

m 
•£ 

jlS ! 


Partition 2 - 


1 


RdOnly 


Set if Port 1 is a member of partition 2 


Portl 








!U 

UlS 


Partition 2 - 


! 


RdOnly 


Set if Port 2 is a member of partition 2 


a 
a 


Port 2 








Partition 2 - 
Port 3 


! 


RdOnly 


Set if Port 3 is a member of partition 2 


a 


Partition 3 - 


! 


RdOnly 


Set if Port 0 is a member of partition 3 




PortO 










Partition 3 - 


1 


RdOnly 


Set if Port 1 is a member of partition 3 




Port 1 










Partition 3 - 




RdOnly 


Set if Port 2 is a member of partition 3 




Port 2 








25 


Partition 3 - 
Port 3 




RdOnly 


Set if Port 3 is a member of partition 3 



Table 5 provides the definitions of the port flag register. 

Table 5 
Port Flag Register 



30 



Name 


Bits 


Read/Write 


Definition 
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Port 0 Flag 


8 


Rd/Wr 


These bits can be written only by port 0, but can be 
read oy any pon. 


Port 1 Flag 


8 


Rd/Wr 


These bits can be written only by port I, but can be 
read bv anv nort 


Port 2 Flag 


8 


Rd/Wr 


These bits can be written only by port 2, but can be 
read by any port. 


Port 3 Flag 


8 


Rd/Wr 


These bits can be written only by port 3, but can be 
read by any port. 



□ 
a 

!SSS 

Lfl 

ill 
Ui 

a 

u 
O 

□ 



Table 6 illustrates the port identification register. 

Table 6 
Port ID Register 



Name 


Bits 


Read/Wr 
ite 


Definition 


Port ED 


2 


Rd 


Returns the ID of the port to which the access was 
made. 



10 Tag and address crossbar 70 Control and Status Registers (CSRA) hold initialization, 

configuration, control, and status information relating to various features and operations within 
tag and address crossbar 70. They all may be accessed either by scan or from control agent 
66 through a serial CSR interface to tag and address crossbar 70. CSRs are either port 
specific or global. Port specific registers relate to a particular bus 73 port. Each port has its 
1 5 own set Port specific registers are further divided into performance counters and kernel 

categories. The kernel category consists of configuration and diagnostic registers, for which 
access should be restricted. Each category is aligned on a 16k page boundary. Global 
registers pertain to Tag and address crossbar 70 as a whole. There is only one copy of a 
global register, which is seen in the address space of each of the ports. 
20 Each quad 58 in the system 60 has 64 kb of tag and address crossbar 70 CSR 

memory space, each at a specific address range. These addresses appear in the address 
space of all quads in a partition, and are therefore referred to as Shared CSR addresses. In 
addition, each quad 58 has 64 kb of local tag and address crossbar 70 CSR memory space. 
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Accesses to Local CSR memory space axe routed to the CSRs associated with the physical 
port that receives the access. Therefore, registers are only accessible at the Local CSR address 
through the port to which they are local. 

The global register block logic is shown in Fig. 4. In addition to holding the global 
category of CSR registers, the Global Register block 100 performs other logical services. It 
will check for partition inconsistencies, where a port is being configured to have the same 
logical ID as a Port that is already committed to the partition. P*logID is the two bit logical ID 
value programmed for each port. P*parNum is the two-bit partition number programmed for 
each port. Only ports which are already committed to a partition are considered. The error is 
flagged when the duplicate P*parNum and P*logID are written and the P*Commit bits are set. 
The CSR Global Register block is also responsible for generation of the partition vector 
(parvec), the 16-bit vector that collectively summarizes the status of all bus 73 ports that are 
included in the four possible partitions in the system shown in the preferred embodiment. It is 
used internally to tag and address crossbar 70 and is also visible as a CSR register. There is a 
programmable 2-bit field for each bus 73 port that contains the number of the partition to which 
the port belongs. These port references are listed in Table 7. 



Table 7 
Port References 



Port Partition # 


Vector 


00 


0001 


01 


0010 


10 


0100 


11 


1000 



Fig. 9 illustrates part of the global configuration register used to configure system-wide 
25 parameters. Fig. 9 is truncated for clarity, as it illustrates only the definitions of bit numbers 28 
through 3 1 inclusive shown as group 87 in Fig. 9. Each port of system 60 has its own 
individual P*WrPROT bit to which only that port can write. All ports of the system can read 



a 



iil 

. « 
111 

: : i 



10 



□ 

a 
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a 
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the entire register 88. All bits in this register other than those in the write protect bit group 87 
are write protected if any of the ports have set their P*WrPROT bit. In register 88, the first 
port to write to the register and set its write protection bit establishes the configuration, 
thereafter, only changes can be made with the cooperation of all ports. In the preferred 
5 embodiment, the register can be read to determine which ports have their write protect bits set 
if it is desired to read the status of the write protect bits 87. It will be appreciated that since 
register 88 is illustrated in Fig. 9 as a truncated version of the entire register in that the register 
is comprised of 32 bits, the remaining bits of register 88 down to and including bit 0 have other 
functions in the system, not directly related to the partition write protection system described 

1 0 herein. It will be further appreciated that the P* WrPROT bits are functionally distinct and 
independent from the partition write protect bit P*PartWrProt bits 77 and the port write 
protect bits P*PortWrProt bit 78. 

Fig. 8 illustrates port 0 partition configuration register 89. Register 89 is also a 32 bit 
register used to store information about each port's configuration relative to its partition 

1 5 assignment As each physical port in the system has a partition configuration register equivalent 
to register 89 in Fig. 8, it can be appreciated that the description of register 89 also describes 
the other three configuration registers for ports 1, 2, and 3 in the embodiment disclosed. Each 
configuration register is located at a separate local control and status register (CSR) address. 
All the partition configuration registers for all four ports may be accessed from any port in the 

20 system. As described in Fig. 8, configuration register 89 is used to specify information about 
crossbar 70's physical port 0, shown on Fig. 1, and the partition to which port 0 belongs. Port 
O's connections to crossbar 70 are shown at input 40 and output 45 in Fig. 1. 

A partition write protect bit, POPartWrProt 77, and a port write protect, 
POPortWrProt 78, are used to restrict write access and protect partitions from non-member 

25 quads (via their ports) in the system. In the invention, a restriction is provided that a specified 
bit must be in a certain state in order to enable writing of another bit, and such state must exist 
before a write is successful in changing the state of that bit. In general, either write protect bit 
does not restrict writes by the port that owns the Partition Configuration register that is the 
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# 



target of the write (or the target port, in this case, port 0), except that POPartWrProt 77 can 
only be written by a port that is member of the target port's committed partition. 
POPartWrProt 77 restricts writes of bits 91, 79, and 93 by non-owner ports, all under certain 
circumstances. POPortWrProt 78 restricts writes of bits 91, 78, 79, 92, and 93 by non-owner 
5 ports, all under certain circumstances. Each bit or field that is protected by these two write 
protect bits does so under the following circumstances: 

• POEnable 91 is protected from writes by non-owning ports when POPortWrProt 78 is active, 
and is protected from writes by non-members of its committed partition when POPartWrProt 
77 is active. 

1 0 ■ POPartWrProt 77 is protected from writes by ports that are non-members of the target port's 

1SSSS 

l.n committed partition. 

*iD 

ji j ■ POPortWrProt 78 is protected from writes by any non-owner port if it is already active. 

\,i 

™ r • POCommit 79 is protected from writes by any non-owner port if POPortWrProt 78 is active, 

or writes by non-owner ports that have their port's PartWrProt active and have a different 

{rab 

O 15 ParNum. 

tD 

□ • POlogID 92 is protected from writes by any non-owner port if POPortWrProt 78 is active. 

ffj 

■ POParNum 93 is protected from writes by any non-owner port if POPortWrProt 78 is active, 
or writes alter it has been committed (by activating POCommit 79) while any other port has its 
PartWrProt active and has the same ParNum as being written here (this blocks a committed 
20 port from simply changing its ParNum). 

POenable bit 91 is a bit that enables port 0 subject to certain conditions. If bit 91 is 
clear, port 0 in the system is disabled and will not respond to port 0 inbound transactions. 

POlogID 92 is a two bit field (3:2) which identifies the logical node ED of crossbar 70 
physical port 0, being shown as the bus comprised of input 40 and output 45 in Fig. 1. Bit 92 
25 is used in crossbar 70's input block to determine whether port 0 is the target of a transaction. 

POCommit bit 79 determines whether port 0 is to be part of the partition indicated by 
field 93, which is POParNum. When bit 79 is cleared, then port 0 is not considered to be part 
of any partition. It could be appreciated that if the POCommit bit 79, or its equivalent in the 
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partition configuration registers for other ports is not set, it is possible that a particular port will 
not be assigned to any partition in the system and therefore could be disabled if desired 
Therefore, the invention allows selection of one or more nodes to operate within a given 
partition, while disabling one or more nodes to be dormant in a given configuration of the 
5 system. 

Advantages 

With the above, a complete disclosure for a system to allow different operating systems 
to co-exist on the same physical computer system has been demonstrated with details 
implementing the invention in a preferred embodiment Further, specific implementation of the 

1 0 method and the apparatus disclosed enable inter-partition data passing to utilize existing high 

performance data connections in a multi-node system such that the input/output subsystems can 
remain a shared hardware resource. It should be appreciated that with the system and method 
disclosed it is possible to obtain significant data protection across partitioned resources without 
utilizing each of the subsystems demonstrated in the detailed disclosure above and the various 

1 5 drawings and figures provided. 

Alternatives 

The system is not necessarily limited to the specific number of processors or the array 
of processors disclosed, but may be used in similar system designs using interconnected 
memory control systems with tagging, address crossbar and data crossbar systems to 
20 communicate between the controllers to implement the present inventioa Accordingly, the 
scope of the present invention fully encompasses other embodiments which may become 
apparent to those skilled in the art. Furthermore, the scope of the present invention is to be 
limited by nothing other than the appended claims. 

CLAIMS 
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